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PATENT APPLICATION: US/1 0/0 3 0 , 82 9 



DATE: 09/12/2002 
TIME: 15:54:19 
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Input Set : A:\100303829SeqList.txt 
Output Set: N:\CRF4\09122002\J030829, 

Christophe 



raw 



APPLICANT : Beclin, 
Elmayan, Taline 
Vaucheret , Herve 

TITLE OF INVENTION: NOVEL SGS 3 PLANT GENE AND USES THEREOF 

FILE REFERENCE: A34920- PCT-USA 072667.0179 

CURRENT APPLICATION NUMBER: 10/030,829 

CURRENT FILING DATE: 2002-0.1-11 

PRIOR APPLICATION NUMBER: PCT/FR/0 0/0 2 0 5 2 

PRIOR FILING DATE: 2000-01-26 

PRIOR APPLICATION NUMBER: FR 99/09,417 

PRIOR FILING DATE: 1999-07-16 

NUMBER OF SEQ ID NOS : 5 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO: 1 
LENGTH: 3275 
TYPE: DNA 

Arabidopsis thaliana 



D 



ORGANISM : 
FEATURE: 
NAME/KEY : 
LOCATION: 



primer _b ind 
(693) . . . (715) 
OTHER INFORMATION: p356AD' 
NAME/KEY : primer_bind 
LOCATION: (2926) . . . (2952) 
OTHER INFORMATION: P356Y' 
W--> 37 <400> 1 

gacaaacaaa caaaaattaa gcaagtcatg ttcgtagcaa taaattaata gtgggaacaa 

ttaagttaag cgaaaaagga aaaaaaaagg tacaaaaatg aaaacaaaat caaactgaat 

gaaaatttgg agtccagaat cggaaaaacg aggccgtttt agagcttaat aagcttcctc 

atttgtctct tcttcgtcag tttan-ttct tcctcoggag tcctgactca ctact.ct:ac 

tctojggcgc tttaaactta cgttctccgt cgLttactct gtaagttttc tgccttagag 

cetccgatcg cctcaccgca tgcattctgt gctcgatttc tctttttctt cgctggaaaa 

attgocctaa tgttctcgat ttcgaaggtt tttgtgctat gggttacttt tttccctata 

ttttatagtt cttaggtaac gatacctgcg tcttactgtt tttgttcatt ttgttgtgct 

ttnaocgttt agtcgctgat cggagtattt gactgtgaaa aatccttcgt tttttggttt 

ttqtttcata taaatcggat tgatctacct tttgtgcttt gatgtttgtt ttttgagcct 

atticgttgtt ggcttgttat aacttcacgt tcatgtgtgg attttgagat tttggtagtg 

actgtgggtt tctttggtgg ctataggttg taaaaatgag ttctagggct ggtccaatgt 

ctaaqgaaaa gaacgttcag ggtggttata ggcctgaggt tgaacagttg gttcaaggtt 

tgqcagggac gagactggct tcttcacaag atgatggagg agagtgggag gtcatttcca 

agaaqaacaa gaacaaacca ggaaacactt ctggaaaaac ttgggtttct cagaattcga 

atcctcctag agcttggggt ggtcagcagc aagggagagg tagcaacgta tctgggagag 

gaaacaatgt atccgggaga ggtaacggca atggtcgggg cattcaagct aacatatctg 

gtcggggacg agcgttgagc agaaagtatg ataacaactt tgtggcaccc ccacctgtat 
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RAW SEQUENCE LISTING 

PATENT APPLICATION' US/10/0 30,82 9 



DATE: 09/12/2002 
TIME: 15:54:19 



Input Set : A:\100303829SeqList.txt 
Output Set: N:\CRF4\09122002\J030829.raw 



<213> ORGANISM: 
<220> FEATURE: 

0 <221> NAME/KEY 

1 <222> LOCATION 
3 <400> SEQUENCE 



CDS 

(1) • 

2 



(1878) 



5b ctcgccctcc tttggaagga ggatggaatt ggcaggcaag aggaggttct gctcagcaca 1 

57 cagctgtgca ggagtttcct gacgtggagg atgatgtgga taatgcttct gaggaagaga 

58 atgattccga tgctttggat gattctgatg acgaccttgc aagtgatgat tatgactcgg 

59 atgtgagtca aaagagccat ggatcacgaa agcagaataa gtggttcaaa aagttctttg 

60 gcagcttgga tagcttgtcg atcgagcaga taaatgaacc acagaggcag tggcattgtc 
6.1 cagcttgtca gaacggacct ggtgccatcg attggtataa cctgcaccct ctactagctc 1 

62 atgcgaggac aaaaggagct aggcgagtta agctccatag agaattggct gaagttttag 1 

63 aaaaggatct acagatgaga ggcgcatctg tcattccttg tggtgagatt tatgggcagt 1 

64 ggaagggttt gggtgaggat gaaaaggatt atgaaattgt ctggcctcca atggtcatca 1 

65 tcatgaatac tagactggat aaggacgata acgataaggt ggaattcttc tgtcttttac ]. 

66 ttctttaatt tttctcttgc attctactga tcttagaatg ttacattgta gtggctcggc 1 

67 atgggcaacc aagagctgct ggaatacttc gacaagtatg aggctcttag agcacgccat 1 

68 tcctatggtc cacagggcca tcgtgggatg agtgttctga tgtttgagag cagtgccact 1 

69 ggctatttgg aggccgaacg cctccaccgg gagttagctg agatggggtt agatagaatt 1 

70 gcctggggtc agaagcgcag tatgttttct ggaggtgttc gccaactgta tggcttcctt 1 
7 1 gcaacgaagc aagatctgga catattcaat caacactctc aaggttctct cccccaaaga 
72 aatttgatat atgcttttag ttttgtcatt ggaatttaaa gttttgttgg tccgtgttaa 2 
7 3 tgcatctgtt atgtatatat ctatgattca ttaggcaaaa caaggctgaa attcgagttg 2 
74 aaatcatacc aagagatggt tgtaaaggag ctgaggcaga tctctgagga caatcagcag 2 

7 5 ctgaactact ttaagaacaa gctctcaaaa cagaacaagc acgccaaggt gcttgaggaa 2 

76 tctctggaaa ttatgagcga gaagctgcgt agaactgcag aggataatcg gatcgtgaga 2 

77 cagagaacta agatgcagca tgaacagaac agggaagagg tatgattttt cctagaaaat 2 

78 cacaaacttg acattttgta ttacctactg attcacattt ttgattatat tgtccaacaa 2 

79 aaaacctgtg gtggtttgaa gatggatgca cacgacaggt ttttcatgga tteaatcaaa 2 

80 cagatccatg aaagaagaga cgcaaaggag gagaatttcg agatgttgca gcagcaggaa 2 

81 cgtgccaagg ttgttggeca gcagcagcag aacattaatc cctctagcaa tgacgattgc 2 

82 cgaaagaggt atatgtacta actaacataa tccctctggc gtttttgttt ttcaaaccta 2 

8 3 agagtaactg aattattccg gttttgattc tttcgcagag ctgaggaagt gtcaagcttc 2 

84 atcgagtttc aagagaaaga gatggaggag tttgtggaag agagggagat gctgataaaa 2 

85 gatcaagaga agaagatgga agacatgaag aagaggcatc acgaggagat atttgatctg 2 

86 gagaaagaat ttgatgaggc tttggaacag ctcatgtaca agcatggcct tcacaatgaa 2 

87 gatgattgag acaaaagtct ggtacacaag acaagactaa gtttctttgt tttgcttttg 3 

88 gtatgtcgga aagtaggaga tctgagagac tccatttaaa tactaggaca aatctaagga 3 

89 gattatagat tattatcctc caatttttag tagacggatc taaggaagca ttaagttctt 3 

90 gtgactaaaa ccaagtttcc ttagtatttt gttttttttt ggtaaaattt catatgaaag 3 

91 ttagacatat taccaaacgt cagagtgaat cacagaatgg caaatcaaaa tcatgttttt 3 

92 agaattttat atctacaaaa tatatgggta caaat 3 
94 <210> SEQ ID NO: 2 

< 2il.> LENGTH: 187 8 
12> TYPE: DNA 

Arabldopsis thaLiana 
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4 atgagttcta gggctggtcc aatgtctaag gaaaagaacg ttcagggtgg ttataggcct 

5 gaggttgaac agttggttca aggtttggca gggacgagac tggcttcttc acaagatgat 

6 ggaggagagt gggaggtcat ttccaagaag aacaagaaca aaccaggaaa cacttctgga 

7 aaaacttggg tttctcagaa ttcgaatcct cctagagctt ggggtggtca gcagcaaggg 
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RAW SEQUENCE LISTING DATE: 0 9/12/2 0 0 2 

PATENT APPLICATION: US/10/0 30 , 829 TIME: 15:54:19 

Input Set : A:\100303829SeqList.txt 
Output Set: N:\CRF4\09122002\J030829.raw 

108 agaggtagca acgtatctgg gagaggaaac aatgtatccg ggagaggtaa cggcaatggt 3n0 

109 cggggcattc aagctaacat atctggtcgg ggacgagcgt tgagcagaaa gtatgataac 360 

110 aactttgtgg cacccccacc tgtatctcgc cctcctttgg aaggaggatg gaattggcag 42o 
11 L gcaagaggag gttctgctca gcacacagct gtgcaggagt ttcctgacgt ggaggatgat 48u 

112 gtggataatg cttctgagga agagaatgat tccgatgctt tggatgattc tgatgacgac 54') 

113 cttgcaagtg atgattatga ctcggatgtg agtcaaaaga gccatggatc acgaaagcag 6nn 

114 aataagtggt tcaaaaagtt ctttggcagc ttggatagct tgtcgatcga gcagataaat 660 

115 gaaccacaga ggcagtggca ttgtccagct tgtcagaacg gacctggtgc catcgattgg 720 

116 tataacctgc accctctact agctcatgcg aggacaaaag gagctaggcg agttaagctc 78u 

117 catagagaat tggctgaagt tttagaaaag gatctacaga tgagaggcgc atctgtcatt 84U 

118 ccttgtggtg agatttatgg gcagtggaag ggtttgggtg aggatgaaaa ggattatgaa 90u 
11^ attgtctggc ctccaatggt catcatcatg aatactagac tggataagga cgataacgat 960 

120 aagtggctcg gcatgggcaa ccaagagctg ctggaatact tcgacaagta tgaggctctt 1020 

12 1 agagcacgcc attcctatgg tccacagggc catcgtggga tgagtgttct gatgtttgag 1080 

122 agcagtgcca ctggctattt ggaggccgaa cgcctccacc gggagttagc tgagatgggg 1140 
12 3 ttagatagaa ttgcctgggg tcagaagcgc agtatgtttt ctggaggtgt tcgccaactg 1200 
124 tatggcttcc ttgcaacgaa gcaagatctg gacatattca atcaacactc tcaaggcaaa 1260 

12 5 acaaggctga aattcgagtt gaaatcatac caagagatgg ttgtaaagga gctgaggcag 1320 
126 atctctgagg acaatcagca gctgaactac tttaagaaca agctctcaaa acagaacaag 1380 
1j7 cacgccaagg tgcttgagga atctctggaa attatgagcg agaagctgcg tagaactgca 1440 
U8 gaggataatc ggatcgtgag acagagaact aagatgcagc atgaacagaa cagggaagag 1500 

129 atggatgcac acgacaggtt tttcatggat tcaatcaaac agatccatga aagaagagac 1560 

130 gcaaaggagg agaatttcga gatgttgcag cagcaggaac gtgccaaggt tgttggccag 1620 

131 cagcagcaga acattaatcc ctctagcaat gacgattgcc gaaagagagc tgaggaagtg 1680 

132 tcaagcttca tcgagtttca agagaaagag atggaggagt ttgtggaaga gagggagatg 1740 

13 3 ctgataaaag atcaagagaa gaagatggaa gacatgaaga agaggcatca cgaggagata 1800 

134 tttgatctgg agaaagaatt tgatgaggct ttggaacagc tcatgtacaa gcatggcctt 1860 

135 cacaatgaag atgattga 187 8 

137 ^210> SEQ ID NO: 3 

138 <211> LENGTH: 62 5 

139 <:212> TYPE: PRT 

140 <:213> ORGANISM: Arabldopsis thaliana 

14 2 <4 00> SEQUENCE: 3 
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RAW SEQUENCE LISTING DATE: 09/12/2002 

PATENT APPLICATION: US/10/0 30 , 829 TIME: 15:54:19 

Input Set : A:\100303829SeqList.txt 
Output Set: N:\CRF4\09122002\J030829.raw 

159 Sf?r Arg Pro Pro Leu Glu Gly Gly Trp Asn Trp Gin Ala Arg Gly Gly 
HiO 130 135 140 

Ihl Ser Ala Gin His Thr Ala Val Gin Glu Phe Pro Asp Val Glu Asp Asp 
lh2 145 150 155 160 

lb 3 Val Asp Asn Ala Ser Glu Glu Glu Asn Asp Ser Asp Ala Leu Asp Asp 

164 165 170 175 

165 Ser Asp Asp Asp Leu Ala Ser Asp Asp Tyr Asp Ser Asp Val Ser Gin 
luf, 180 185 190 

167 Lys Ser His Gly Ser Arg Lys Gin Asn Lys Trp Phe Lys Lys Phe Phe 

168 195 200 205 

1 1'. 'H Gly Ser Leu Asp Ser Leu Ser He Glu Gin lie Asn Glu Pro Gin Arg 
170 210 215 220 

17 1 g I n Trp His Cys Pro Ala Cys Gin Asn G l y Pro Gly Ala He Asp Trp 
17J 225 230 235 240 

17 3 Tyr Asn Leu His Pro Leu Leu Ala His Ala Arg Thr Lys Gly Ala Arg 
174 245 2 r .O 255 

17 5 Arg Val Lys Leu His Arg Glu Leu Ala Glu Val Leu Glu Lys Asp Leu 

17 6 260 26 5 270 

17"7 Gin Met Arg Gly Ala Ser Val He Pro Cys Gly Glu He Tyr Gly Gin 

178 275 280 285 

179 Trp Lys Gly Leu Gly Glu Asp Glu Lys Asp Tyr Glu He Val Trp Pro 

180 290 295 300 

181 Pro Met Val He He Met Asn Thr Arg Leu Asp Lys Asp Asp Asn Asp 

182 3n5 310 315 320 

18 < Lys Trp Leu Gly Met Gly Asn Gin Glu Leu Leu Glu Tyr Phe Asp Lys 

184 325 330 335 

185 Tyr Glu Ala Leu Arg Ala Arg His Ser Tyr Gly Pro Gin Gly His Arg 

186 340 34 5 3 50 

187 Gly Met Ser Val Leu Met Phe Glu Ser Ser Ala Thr Gly Tyr Leu Glu 

188 355 360 365 

189 Ala Glu Arg Leu His Arg Glu Leu Ala Glu Met Gly Leu Asp Arg He 

190 370 375 380 

19.1. Ala Trp Gly Gin Lys Arg Ser Met Phe Ser Gly Gly Val Arg Gin Leu 
192 385 390 395 400 

19 3 Tyr Gly Phe Leu Ala Thr Lys Gin Asp Leu Asp He Phe Asn Gin His 
194 405 410 415 

IMS Ser Gin Gly Lys Thr Arg Leu Lys Phe Glu Leu Lys Ser Tyr Gin Glu 

1*1} 420 425 430 

lo~ M« -t Ya 1 Val Lys Glu Leu Arg G ; n lie Ser Glu Asp Asn Gin Gin Leu 

19m 4 35 440 44 5 

1 1 1 9 Asn Tyr Phe Lys Asn Lys Leu Ser Lys Gin Asn Lys His Ala Lys Val 

20u 450 455 460 

201 Leu Glu Glu Ser Leu Glu He Met Ser Glu Lys Leu Arg Arg Thr Ala 

202 4o5 470 475 480 

203 Glu Asp Asn Arg He Val Arg Gin Arg Thr Lys Met Gin His Glu Gin 

20 4 4 85 4 90 4 95 

205 Asn Arg Glu Glu Met Asp Ala His Asp Arg Phe Phe Met Asp Ser He 

206 500 505 510 

207 Lys Gin He His Glu Arg Arg Asp Ala Lys Glu Glu Asn Phe Glu Met 
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RAW SEQUENCE LISTING DATE 09/12/2002 

PATENT APPLICATION: US/10/0 3 0 , 82 9 TIME: 15:54:19 

Input Set : A:\100303829SeqList.txt 
Output Set: N:\CRF4\09122002\J030829.raw 

208 515 520 525 

209 Leu Gin Gin Gin Glu Arq Ala Lys Val Val Gly Gin Gin Gin Gin Asn 

210 530 535 540 

211 He Asn Pro Ser Ser Asn Asp Asp Cys Arg Lys Arg Ala Glu Glu Val 

212 545 550 555 560 

213 Ser Ser Phe He Glu Phe Gin Glu Lys Glu Met Glu Glu Phe Val Glu 

214 565 570 575 

215 Glu Arg Glu Met Leu He Lys Asp Gin Glu Lys Lys Met Glu Asp Met 

216 580 585 590 

217 Lys Lys Arg His His Glu Glu He Phe Asp Leu Glu Lys Glu Phe Asp 

218 59 5 6 00 605 

219 Glu Ala Leu Glu Gin Leu Met Tyr Lys His Gly Leu His Asn Glu Asp 

220 610 615 620 

221 Asp 

222 625 

225 <210> SEQ ID NO: 4 

226 02ll> LENGTH: 23 
.127 <212> TYPE: DNA 

228 -:213> ORGANISM: Artificial Sequence 
2 30 <J20> FEATURE: 

2 31 -:223> OTHER INFORMATION: Oligonucleotide p356AD' 
2 33 <4 0 0> SEQUENCE: 4 

2 34 aaaatgagtt ctagggctgg tec 23 
2 36 - 2l0> SEQ ID NO: 5 
2 37 -;2ll> LENGTH: 27 
J 38 -:212> TYPE: DNA 

239 <213:> ORGANISM: Artificial Sequence 

241 -220> FEATURE: 

242 v223> OTHER INFORMATION: Oligonucleotide p356Y* 
J44 .;400> SEQUENCE: 5 

245 gtctcaatca tcttcattgt gaaggee 2/ 
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VERIFICATION SUMMARY DATE: 09/12/2002 

PATENT APPLICATION: US/10/0 30,829 TIME: 15:54:20 

Input Set : A:\100303829SeqList.txt 
Output Sot: N:\CRF4\09122002\J030829.raw 

L:37 M:258 W: Mandatory Feature missing, <220> not found for SEQ ID#:1 
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